test: Benchmarks by toby-coleman · Pull Request #160 · plugboard-dev/plugboard

toby-coleman · 2025-08-23T18:42:38Z

Summary

Adds a simple benchmark to the test suite.

Changes

Adds benchmark test on a simple 3-node model.
Reports benchmark results on each new PR, so that the code changes can be compared with main branch before merging.

gemini-code-assist

Summary of Changes

Hello @toby-coleman, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request introduces a new benchmarking capability to the test suite. Its primary goal is to enable performance regression testing by adding a simple benchmark for a 3-node model and configuring the system to report these results on new pull requests, allowing for direct comparison against the main branch before merging.

Highlights

New Benchmark Test: Implemented a new benchmark test for a basic 3-node Plugboard model to measure process execution performance.
Automated Benchmark Reporting: Configured the test suite to automatically report benchmark results on new pull requests, facilitating performance comparisons against the main branch.
Dependency Integration: Integrated the pytest-benchmark library into the project's test dependencies.
Test File Structure: Created dedicated files for organizing benchmark tests within the tests/benchmark directory.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces benchmarking to the test suite using pytest-benchmark, which is a great addition for performance tracking. The changes are well-structured, adding a new benchmark test for a simple 3-node model.

My review focuses on the implementation of the new benchmark test. I've suggested a refactoring to improve the clarity and conciseness of the test code, and to leverage pytest-benchmark's auto-rounding feature for more stable results. Overall, this is a valuable contribution to the project.

codecov · 2025-08-23T18:57:48Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

github-actions · 2025-08-23T18:58:57Z

Benchmark comparison for bbfd6d1f (base) vs 4917f5c5 (PR)


---------------------------------------------------- benchmark: 1 tests ----------------------------------------------------
Name (time in ms)                   Min       Max      Mean  StdDev    Median      IQR  Outliers     OPS  Rounds  Iterations
----------------------------------------------------------------------------------------------------------------------------
test_benchmark_process_run     858.1550  877.9909  866.9117  7.7217  865.4655  11.3752       2;0  1.1535       5           1
----------------------------------------------------------------------------------------------------------------------------

Legend:
  Outliers: 1 Standard Deviation from Mean; 1.5 IQR (InterQuartile Range) from 1st Quartile and 3rd Quartile.
  OPS: Operations Per Second, computed as 1 / Mean

github-actions · 2025-08-23T20:03:39Z

Benchmark comparison for bbfd6d1f (base) vs b5af6f72 (PR)


---------------------------------------------------- benchmark: 1 tests ---------------------------------------------------
Name (time in ms)                   Min       Max      Mean  StdDev    Median     IQR  Outliers     OPS  Rounds  Iterations
---------------------------------------------------------------------------------------------------------------------------
test_benchmark_process_run     867.2432  892.1093  875.9666  9.6154  874.9317  9.8430       1;0  1.1416       5           1
---------------------------------------------------------------------------------------------------------------------------

Legend:
  Outliers: 1 Standard Deviation from Mean; 1.5 IQR (InterQuartile Range) from 1st Quartile and 3rd Quartile.
  OPS: Operations Per Second, computed as 1 / Mean

chrisk314

lgtm

toby-coleman added 7 commits August 23, 2025 13:32

Add pytest-benchmark

5870a60

Add simple benchmark

7c1e28d

Marker not needed

e78c5ec

Ignore benchmark files

818e369

Add github action

159a8bb

Simplify

15649fe

Only comment on open

c22924d

gemini-code-assist Bot reviewed Aug 23, 2025

View reviewed changes

Comment thread tests/benchmark/test_benchmarking.py Outdated

toby-coleman added 4 commits August 23, 2025 19:44

Fixup

6eeb628

Add missing setup stage

d5ddcf4

Update createComment syntax

93f4dee

Remove marker

9249d51

Fix

4917f5c

plugboard-dev deleted a comment from github-actions Bot Aug 23, 2025

toby-coleman added 5 commits August 23, 2025 20:40

Try something different

b1b8969

Fix uv install

62544ac

Change order

6ab0a94

Fix

1a2834c

Fix comparison

06fe890

plugboard-dev deleted a comment from github-actions Bot Aug 23, 2025

Working?

b5af6f7

plugboard-dev deleted a comment from github-actions Bot Aug 23, 2025

chrisk314 approved these changes Aug 27, 2025

View reviewed changes

toby-coleman merged commit c32f38b into main Aug 28, 2025
18 checks passed

toby-coleman deleted the test/benchmarking branch August 28, 2025 12:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: Benchmarks#160

test: Benchmarks#160
toby-coleman merged 18 commits into
mainfrom
test/benchmarking

toby-coleman commented Aug 23, 2025

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

codecov Bot commented Aug 23, 2025

Uh oh!

github-actions Bot commented Aug 23, 2025

Uh oh!

github-actions Bot commented Aug 23, 2025

Uh oh!

chrisk314 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

toby-coleman commented Aug 23, 2025

Summary

Changes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

codecov Bot commented Aug 23, 2025

Codecov Report

Uh oh!

github-actions Bot commented Aug 23, 2025

Uh oh!

github-actions Bot commented Aug 23, 2025

Uh oh!

chrisk314 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants